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We consider specification and inference for the stochastic scale 
of discretely-observed pure-jump semimartingales with locally stable 
Levy densities in the setting where both the time span of the data 
set increases, and the mesh of the observation grid decreases. The 
estimation is based on constructing a nonparametric estimate for the 
empirical Laplace transform of the stochastic scale over a given inter- 
val of time by aggregating high-frequency increments of the observed 
process on that time interval into a statistic we call realized Laplace 
transform. The realized Laplace transform depends on the activity of 
the driving pure-jump martingale, and we consider both cases when 
the latter is known or has to be inferred from the data. 

1. Introduction. Continuous-time semimartingales are used extensively 
for modeling many processes in various areas, particularly in finance. Typ- 
ically the model of interest is an ltd semimartingale (semimartingale with 
absolute continuous characteristics) given by 

(1.1) dX t = a t dt + a t - dZ t + dY t , 

where at and at are some processes with cadlag paths, Z t is an infinite 
variation Levy martingale and If is a finite variation jump process satis- 
fying certain regularity conditions (all technical conditions on the various 
processes will be given later). The martingale Z% can be continuous (i.e., 
Brownian motion), jump-diffusion or of pure-jump type (i.e., without a con- 
tinuous component). The presence of the last term in (1.1) might appear 
redundant, as Zt can already contain jumps, but its presence will allow us 
to encompass also the class of time-changed Levy processes in our analysis. 
In any case, this last term in (1.1) is dominated over small scales by the 
term involving Zt- 
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Our interest in this paper will be in inference to the process at when Zt is 
a pure-jump Levy process. Pure-jump models have been used to study vari- 
ous processes of interest such as volatility and volume of financial prices [3, 
5], traffic data [21] and electricity prices [18]. 

Parametric or nonpar ametric estimation of a model satisfying (1.1) in the 
pure-jump case is quite complicated for at least the following reasons. First, 
very often the transitional density of Xt is not known in closed-form. This 
holds true, even in the relatively simple case when Xt is a pure-jump Levy 
process. Second, in many situations the realistic specification of at often 
implies that Xt is not a Markov process, with respect to its own filtration, 
and hence all developed methods for estimation of the latter will not apply. 
Third, the various parameters of the model (1.1) capture different statistical 
properties of the process Xt and hence will have various rates of convergence 
depending on the sampling scheme. For example, in general, at and the tails 
of Zt can be estimated consistently only when the span of the data increases, 
whereas the so-called activity of Zt can be recovered even from a fixed-span 
data set, provided the mesh of the latter decreases; see [6] for estimation 
of the activity from low- frequency data set. Finally, the simulation of the 
process Xt can be, in many cases, difficult or time consuming. 

In view of the above-mentioned difficulties, our goal here is specifica- 
tion analysis for only part of the model, mainly the process at, in the case 
when Xt is a pure-jump process following (1.1). We conduct inference in 
the case when we have a high-frequency data set of Xt with increasing time 
span; see [6] and [22] for inference about jump processes based on low fre- 
quency. We refer to at as the stochastic scale of the pure-jump process X t 
in analogy with the scale parameter of a stable process, that is, when Xt is 
a stable process then the constant a is the scale of the process, at is key in 
the specification of (1.1) and, in particular, it captures the time- variation of 
the process Xt over small intervals of time. Our goal here will be to make 
the inference about at robust to the rest of the components of the model, 
that is, the specification of at and Yt, as well as the dependence between at 
and Zt- 

The inference in the paper is for processes for which the Levy measure of 
the driving martingale Zt in (1.1) behaves around zero like that of a stable 
process. This covers, of course, the stable process, but also many other Levy 
processes of interest with details provided in Section 2 below. The idea of 
our proposed method of inference is to use the fact that when Zt is locally 
stable, the leading component of the process Xt over small scales is governed 
by that of the "stable component" of Zt- Moreover, when at is an Ito semi- 
martingale, then "locally" its changes are negligible and at can be treated 
as constant. Intuitively then infill asymptotics can be conducted as if the 
increments of Xt are products of (a locally constant) stochastic scale and 
independent i.i.d. stable random variables. This, in particular, implies that 
the empirical characteristic function of the high-frequency increments over 
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a small interval of time will estimate the characteristic function of a scaled 
stable process. The latter, however, is the Laplace transform for the lo- 
cally constant stochastic scale. Therefore, aggregating over a fixed interval 
time the empirical characteristic function of the (appropriately re-scaled) 
high-frequency increments of Xt provides a nonparametric estimate for the 
empirical Laplace transform of the stochastic scale over that interval. We 
refer to this simple statistic as realized Laplace transform for pure-jump 
processes. The connection between the empirical characteristic function of 
the driving martingale and the Laplace transform of the stochastic scale 
in the context of time-changed Levy processes, with time-change indepen- 
dent of the driving martingale, has been previously used for low-frequency 
estimation in [7]. 

The inference based on the realized Laplace transform is robust to the 
specification of at, as well as the tail behavior of Zt. Intuitively, this is 
due to our use of the high-frequency data whose marginal law is essentially 
determined by the small jumps of Z% and the stochastic scale at- Quite 
naturally, however, our inference depends on the activity of the small jumps 
of the driving martingale Zt - The latter corresponds to the index of the stable 
part of Z, and using the self-similarity of the stable process, it determines its 
scaling over different (high) frequencies. Therefore, the activity index enters 
directly into the calculation of the realized Laplace transform. We conduct 
inference, both in the case where the activity is assumed known, and when it 
needs to be estimated from the data. The estimation of the activity index, 
however, differs from the inference for the stochastic scale. While for the 
latter we need, in general, the time-span to increase to infinity (except for 
the degenerate case when at is actually constant), for the former, this is 
not the case. The activity index can be estimated only with a fixed span 
of high-frequency data, and, in general, increasing time-span will not help 
for its nonparametric estimation. Therefore, we estimate the activity index 
of Zt using the initial part of the sample with a fixed span and then plug it 
into the construction of the realized Laplace transform. We further quantify 
the asymptotic effect from this plug-in approach on the inference for the 
Laplace transform of the stochastic scale. 

The Laplace transform of the stochastic scale preserves the information 
for its marginal distribution. Therefore, it can be used for efficient estima- 
tion and specification testing. We illustrate this in a parametric setting by 
minimizing the distance between our nonparametric Laplace estimate and 
a model implied one, which is similar to estimation based on the empirical 
characteristic function, as in [13]. 

Finally, the current paper studies the realized Laplace transform for the 
case when Zt is pure-jump, while [29] (and the empirical application of it 
in [30]) consider the case where Zt is a Brownian motion. The pure-jump case 
is substantively different, starting from the very construction of the statistic 
as well as its asymptotic behavior. The leading component in the asymptotic 
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expansions in the pure-jump case is a stable process with an index of less 
than 2, and this index is, in general, unknown and needs to be estimated, 
which further necessitates different statistical analysis from the continuous 
case. Also, the residual components in Xt, like at, play a more prominent 
role when Xt is of pure-jump type, and when the activity is low this requires 
modifying appropriately the realized Laplace transform to purge them. 

The paper is organized as follows. Section 2 presents the formal setup 
and assumptions. Section 3 introduces the realized Laplace transform and 
derives its limit behavior. In Section 4 we conduct a Monte Carlo study, 
and in Section 5 we present a parametric application of the developed limit 
theory. Section 6 concludes. The proofs are given in Section 7. 

2. Setting and assumptions. Throughout the paper, the process of in- 
terest is denoted with Xt and is defined on some filtered probability space 
(Q, T , (J r t)t>o,P). Before stating our assumptions, we recall that a Levy pro- 
cess Lt with characteristic triplet (b, c, v) with respect to truncation function 
k(x) = x (we will always assume that the process has a finite first order mo- 
ment) is a process with characteristic function given by 



With this notation, we assume that the Levy process Z% in (1.1) has a charac- 
teristic triplet (0, 0, v) for v some Levy measure. Note that since the trunca- 
tion function with respect to which the characteristics of the Levy processes 
are presented is the identity, the above implies that Zt is a pure-jump mar- 
tingale. The first term in (1.1) is the drift term. It captures the persistence 
in the process, and when Xt is used to model financial prices, the drift cap- 
tures compensation for risk and time. The second term in (1.1) is defined 
in a stochastic sense, since in Assumption A below, we will assume that Zt 
is of infinite variation. The last term in (1.1) is a finite variation pure-jump 
process. Assumption A below will impose some restrictions on its proper- 
ties, but we stress that there is no assumption of independence between the 
processes at, Z t and Y t . 

In the pure-jump model the jump martingale, Zt substitutes the Brownian 
motion used in jump-diffusions to model the "small" moves. We note that 
the "dominant" part of the increment of Xt over a short interval of time 
(t,t + A) is a t - x (Z t+A -Z t ). This term is of order O p (A a ) for a £ [1/2,1), 
while the rest of the components of Xt are at most O p (A) when A 1 0. 

We recall from the introduction that our object of interest in this paper is 
the stochastic scale of the martingale component of Xt, that is, at- Of course 
we observe only Xt and at is hidden into it, so our goal in the paper will be 
to uncover at, and its distribution in particular, with assuming as little as 
possible about the rest of the components of X t and the specification of at 




REALIZED LAPLACE TRANSFORMS FOR PURE-JUMP 



5 



itself (including the activity of the driving martingale). Given the preced- 
ing discussion, the scaling of the driving martingale components over short 
intervals of time will be of crucial importance for us, as, at best, we can ob- 
serve only a product of the stochastic scale with Zfj+A- Our Assumption A 
below characterizes the behavior of Zt and Yj over small scales. 

Assumption A. The Levy density of Zt, v, is given by 

A f 

(2.2) u(x) = , +v f (x), /3G(1,2), / \x\v(x)dx <oo, 

Fr + Jr 

where 

(2.3) A- p -™"%' m y\ KM 

and further there exists xq > such that for \x\ < xq we have \u'(x)\ < 

for some f3' < 1 and a constant C > 0. 

We further have Yt absolutely integrable and E|Yf — Y" s |^ < C\t — s\\ log \t — 
s\\ for every t,s>0 with \t — s\ < 1, some positive constant C , and /3' < 1 
being the constant above. 

Assumption A implies that the small scale behavior of the driving martin- 
gale Zt is like that of a stable process with index f3. The index f3 determines 
the "activity" of the driving process, that is, the vibrancy of its trajectories, 
and thus henceforth we will refer to it as the activity. Formally, (3 equals the 
Blumenthal-Getoor index of the Levy process Zt . The value of the index (3 
is crucial for recovering at from the discrete data on Xt, as intuitively it 
determines how big on average the increments Ztt+A should be for a given 
sampling frequency. The following lemma makes this formal. 

Lemma 1. Let Zt satisfy Assumption A. Then for h — > 0, we have 

(2.4) h -W Zth -^S t , 

where the convergence is for the Skorokhod topology on the space of cadlag 
functions. M + — > M, and St is a stable process with characteristic function 
E ( e iuSi) = e -|u|"/2 < 

The value of the constant A in (2.3) is a normalization that we impose. 
We are obviously free to do that since what we observe is X t , whose lead- 
ing component over small scales is an integral of at, with respect to the 
jump martingale Zt, and we never observe the two separately. The above 
choice of A is a convenient one that ensures that when f3 — > 2, the jump 
process converges finite-dimensionally to Brownian motion. We note that in 
Assumption A we rule out the case /3 < 1, but this is done for brevity of 
exposition, as most processes of interest are of infinite variation (although 
we rule out some important processes like the generalized hyperbolic). 
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In Assumption A we restrict the "activity" of the "residual" jump com- 
ponents of Xt] that is, we limit their effect in determining the small moves 
of X. The effect of the "residual" jump components on the small moves is 
controlled by the parameter j3' . Prom (2.2), the leading component of v(x) 
is the Levy density of a stable, and v'{x) is the residual one. The restriction 
ft < 1, implies that the "residual" jump component is of finite variation. 
This restriction is not necessary for convergence in probability results (only 
f3' < f3 is needed for this), but is probably unavoidable if one needs also the 
asymptotic distribution of the statistics that we introduce in the paper. In 
most parametric models this restriction is satisfied. 

We note that v'(x) in (2.2) is a signed measure, and therefore Assump- 
tion A restricts only the behavior of v{x) for x ~ to be like that of a stable 
process. However, for the big jumps, that is, when |x| > K for some arbitrary 
K > 0, the stable part of v{x) can be completely eliminated or tempered by 
negative values of the "residual" v'(x). An example of this, which is cov- 
ered by our Assumption A, is the tempered stable process of [24], generated 
from the stable by tempering its tails, which has all of its moments finite. 
Therefore, while Assumption A ties the small scale behavior of the driving 
martingale Zt with that of a stable process, it leaves its large scale behavior 
unrestricted (i.e., the limit of h~ a Zih for some a > when h — > oo is unre- 
stricted by our assumption) and thus, in particular, unrelated with that of 
a stable process. 

Remark 1. Assumption A is analogous to the assumption used in [2]. 
It is also related to the so-called regular Levy processes of exponential type, 
studied in [9], with f3 = v in the notation of that paper. Compared with the 
above mentioned processes of [9], we impose slightly more structure on the 
Levy density around zero but no restriction outside of it. We note that if 
Assumption A fails, then the results that follow are not true. The degree of 
the violation depends on the sampling frequency and the deviation of the 
characteristic function of Z over small scales from that of a stable. 

Finally, the process Yt also captures a "residual" jump component of Xt 
in terms of its small scale behavior. Assumption A limits its activity by f3' . 
The component of Z corresponding to v'{x) and Yt control the jump measure 
of Xt away from zero. Unlike the former, whose time variation is determined 
by <7j, the latter has essentially unrestricted time variation. There is clearly 
some "redundancy" in the specification in (1.1) in terms of modeling the 
jumps of Zt away from zero, but this is done to cover more general pure- 
jump models, as we make clear from the following two remarks. 

Remark 2. Assumption A nests time-changed Levy processes with ab- 
solute continuous time changes (see, e.g., [11]), that is, specifications of the 
form 



(2.5) 
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where \i is a random integer- valued measure with compensator at dt ® v{dx) 
for some nonnegative process at and Levy measure v{dx) satisfying (2.2) of 
Assumption A and /I = fi — v. This can be shown using Theorem 14.68 of [14] 
or Theorem 2.1.2 of [15], linking integrals of random functions with respect to 
Poisson measure and random integer- valued measures, and implies that Xt 
in (2.5) can be equivalently represented as (1.1) with at given by a\^ . 

On the other hand, if we start with X given by (1.1), with Z strictly 
stable and no Y, we can show, using the definition of jump compensator 
and Theorem II. 1.8 of [16], that the latter is a time-changed Levy process 
with time change |<7t_|^. For more general "stable-like" Levy processes, we 
need to introduce an additional term [this is Yj in (1.1)] in addition to the 
above time-changed stable process. 

We note that the connection of (1.1) with the time-changed Levy processes 
does not depend on the presence of any dependence between at and Z t . 

Remark 3. Assumption A is also satisfied by the pure-jump Levy- 
driven CARMA models (continuous-time autoregressive moving average) 
which have been used for modeling series exhibiting persistence; see, for ex- 
ample, [10] and the many references therein. For these processes at in (1.1) 
is a constant. 

Our next assumption imposes minimal integrability conditions on at and at 
and further limits the amount of variation in these processes over short peri- 
ods of time. Intuitively, we will need the latter to guarantee that by sampling 
frequently enough we can treat "locally" at (and at) as constant. 

Assumption B. The process at is an ltd semimartingale given by 

(2.6) at = ao + / a s ds + a s dW s + / / 5(s—,x)fl(ds,dx), 
Jo Jo Jo Jr ~ 

where W is a Brownian motion; fj, is a homogenous Poisson measure, with 
Levy measure v_(dx), having arbitrary dependence with fi, for fi being the 
jump measure of Z. We assume \S(t,x)\ < 7t^(x) for some integrable pro- 
cess 7t and f R (5(x) A l)' 3 u(dx) < oo for some /3" < 2. Further, for every t 
and s, we have 



(2.7) 



E^a 2 t + al + a\ + a 2 t + {a' t f + J 5 2 (t,x)v(dx)^J < C, 

EMaj - a s \ 2 + \a t - a s \ 2 + J (5(t, x) - S(s, x)) 2 u(dx)j <C\t-s\, 

where C > is some constant that does not depend on t and s. 

Assumption B imposes at to be an ltd semimartingale. This is a rel- 
atively mild assumption satisfied by the popular multifactor affine jump- 
diffusions [12] as well as the CARMA Levy-driven models used to model 
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persistent processes [10]. Assumption B rules out certain long- memory spec- 
ifications for at, although we believe that, at least for some of them, the 
results in this paper will continue to hold. 

Importantly, however, Assumption B allows for jumps in the stochastic 
scale that can have arbitrary dependence with the jumps in Xt which is par- 
ticularly relevant for modeling financial data, for example, in the parametric 
models of [17]. Finally, the second part of (2.7) will be satisfied when the 
corresponding processes are ltd semimartingales. The next Assumption B' 
restricts Assumption B in a way that will allow us to strengthen some of the 
theoretical results in the next section. 

Assumption B'. The process at is an ltd semimartingale given by 

(2.8) a t = a + / a s ds+ / a s dW s + / a' s dZ s + I I 6(s—,x)p L (ds,dx), 
Jo Jo Jo Jo Jr 

with the same notation as Assumption B, with the only difference being 
that fi is now independent from /i. We also assume that corresponding con- 
dition (2.7) holds as well as that Zt is square-integrable. 

The strengthening in Assumption B' is in the modeling of the dependence 
between the jumps in Xt and at- In Assumption B', this is done via the third 
integral in (2.8). This is similar to modeling dependence between continuous 
martingales using correlated Brownian motions. What is ruled out by As- 
sumption B' is dependence between the jumps in Xt and at that is different 
for the jumps of different size. Assumption B' will be satisfied when the pair 
(Xt,at) are modeled jointly via a Levy-driven SDE. 

Finally, in our estimation we make use of long-span asymptotics for the 
process at and the latter contains temporal dependence. Therefore, we need 
a condition on this dependence that guarantees that a central limit theorem 
for the associated empirical process exists. This condition is given next. 

Assumption C. The volatility at is a stationary and a-mixing process 
with a™ lx = 0(i~ 3_t ) for arbitrarily small t > when t — > oo, where 

af x = sup \P(AnB)-P(A)P(B)\, 

^o = o-(a s ,s<0) and = a(a s ,s > t). 

3. Limit theory for RLT of pure-jump semimartingales. Now we are 

ready to formally define the realized Laplace transform for the pure-jump 
model and derive its asymptotic properties. We assume the process Xt is 
observed at the equidistant times 0, A n , . . . ,iA n , . . . , [T/A n ] where A n is 
the length of the high-frequency interval, and T is the span of the data. The 
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realized Laplace transform is then denned as 

[T/A n ] 



V T (X,A n ,p,u)= A n cos((2 U ) 1 ^A^ 1 //3 A n x) 



AfX = XiA n - -X"(i-i)A. 



n • 



where j3 is the activity index of the driving martingale Z% given in its Levy 
density in (2.2). Vt(X, A n ,/3,u) is the real part of the empirical characteris- 
tic function of the appropriately scaled increments of the process Xt . In the 
case of jump-diffusions, (3 in (3.1) is replaced with 2 as the activity of the 
Brownian motion has an index of 2 (i.e., for the Brownian motion, Lemma 1 
holds with P replaced by 2). We show in this section that Vt(X, A n , f3,u)/T 
is a consistent estimator for the empirical Laplace transform of \&t\ an d 
further derive its asymptotic properties under various sampling schemes 
as well as assumptions regarding whether (3 is known or needs to be es- 
timated. 

3.1. Fixed-span asymptotics. We start with the case when T is fixed and 
A n — > 0, that is, the infill asymptotics, and we further assume we know (3. 
Since the driving martingale over small scales behaves like /3-stable (As- 
sumption A) and the stochastic scale changes over short intervals are not 
too big on average (Assumption B), then the "dominant" part (in a infill 
asymptotic sense) of the increment A™X (when A n is small) is &(i—i)A, n AfZ. 
A"Z is approximately stable, and from Lemma 1, we have approximately 

A?Z i A 1 ^ x Z\ with the characteristic function of Z\ given by e ' u ' ' . 
Therefore, for a fixed T, Vr(X, A n , /3, u) is approximately a sample average 
of a heteroscedastic data series. Thus, by the law of large numbers (when 
A n — > 0), it will converge to f Q e~ u \' Tt ^ ds, which is the empirical Laplace 
transform of \crt\^ after dividing by T. The following theorem gives the pre- 
cise infill asymptotic result. In it we denote with C — s convergence stable in 
law, which means that the convergence in law holds jointly with any random 
variable defined on the original probability space. 

Theorem 1. For the process Xt, assume that Assumptions A and B 
hold with j3' < f3/2, and let A n — > with T fixed. 

(a) If (3 > 4/3, then we have 



(3.2) 
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where E is a standard normal variable defined on an extension of the original 

-rf- 1 ^ _ 2 -xP , i 

probability space and independent from the a -field J- ; Fp(x) = 2 

for x > 0. 

A consistent estimator for the asymptotic variance is given by 
(33) MX, An, P, 2P- l u) - 2V T (X, A n , p,u) + l 

(b) ///3<4/3, then 

(3.4) (v T (X,A n ,fJ,u)- £e~ u ^ds^ = O p (A 2 n ~ 2 ^). 

In the case when Xt is a Levy process, or, more generally, when only 
the scale at is constant as is the case for the Levy-driven CARMA models, 
the above theorem can be used to estimate the scale coefficient a by either 
fixing some it or using a whole range of it's as in the methods for estimation 
of stable processes based on the empirical characteristic function; see, for 
example, [19] and [1]. Furthermore, this can be done jointly with the non- 
parametric estimation of (3 by using for example the estimator we proposed 
in [27] that we define in (3.17) below. 

The limit result in Theorem 1 is driven by the small jumps in Xt, and 
this allows us to disentangle the stochastic scale (which drives their temporal 
variation) from the other components of the model, mainly the jumps away 
from zero. This is due to the fact that the cosine function is bounded and 
infinitely differentiable which limits the effect of the jumps of size away 
from zero on it. By contrast, for example, the infill asymptotic limit of the 
quadratic variation of the discretized process is the quadratic variation of Xt 
which is determined by all jumps, not just the infinitely small ones. 

Unfortunately when the activity of the driving martingale is relatively 
low, i.e., /3 < 4/3, we do not have a CLT for Vt(X, A re ,/3,it). The reason 
is in the presence of the drift term, which for the purposes of our estima- 
tion starts behaving closer to the driving martingale Zt and this slows the 
rate of convergence of our statistic. However, we can use the fact that over 
successive short intervals of time the contribution of the drift term in the 
increments of Xt is the same while sum or difference of i.i.d. stable random 
variables continues to have a stable distribution. Therefore, if we difference 
the increments of Xt, we will remove the drift term (up to the effect due 
to the time-variation in it which will be negligible) and the leading term 
will still be a product of the locally constant stochastic scale and a stable 
variable. Thus we consider the following alternative estimator: 

[T/A n ] 

(3.5) V T (X,A n ,(3,u) = ]T AnCos^A-^A-X-Atx*)]. 

i=2 



REALIZED LAPLACE TRANSFORMS FOR PURE-JUMP 



11 




Theorem 2. For the process Xt, assume that Assumptions A and B 
hold with P' < (3/2, and let A n — > with T fixed. We have 

-±=(v T (X,A n ,p,u)- f T e- u ^ds) 
(3.6) ^ V J ° J 

Fp{uW\a s \)dsxE, 

o 

where E is a standard normal variable defined on an extension of the original 
probability space and independent from the a -field T '; Fp{x) = (- ^ — ~) 2 + 

2 e -^^f!±l- 3e -^ + ( el!!^l ) 2 forx>0 , 

A comparison of the standard errors in (1) and (2) shows that the latter 
can be up to 2.5 times higher than the former for values of /? > 4/3. This 
is the cost of removing the effect of the drift term via the differencing of 
the increments. Therefore, Vt{X, A n , /3, u) should be used only in the case 
when /3 < 4/3. For brevity, the results that follow will be presented only for 
Vt(X, A n , j3, u), but analogous results will hold for Vr(X, A n , (3, u). 



3.2. Long span asymptotics: The case of known activity. We continue 
next with the case when the time span of the data increases together with 
the mesh of observation grid decreasing. The high-frequency data allows 
us to "integrate out" the increments A"Z, that is, it essentially allows 
to "deconvolute" at from the driving martingale of Xt in a robust way. 
After dividing by T, the infill asymptotic limit of (3.1) is the empirical 
Laplace transform of the stochastic scale and we henceforth denote it as 
Cp(u) = ^V T (X,A n ,P,u). Then, by letting T — > oo we can eliminate the 
sampling variation due to the stochastic nature of at, and thus recover its 
population properties, that is, estimate Cp(u) = E(e _- "' cri ^) which is the 
Laplace transform of \at\^ ■ 

The next theorem gives the asymptotic behavior of Cp{u) when both T — > 
oo and A n — > 0. To state the result we first introduce some more notation. 
We henceforth use the shorthand 

(3.7) Z t} p(u) = Vt(X,A n ,/3,u)-V t -i(X,A n ,/3,u), t = l,...,T, 
for the RLT over the time interval (t— l,t). We further set 

1 T ^ ^ 

(3.8) C k , p {u,v) = - J2 {Z t ,p(u)-Cp{u))(Z t _ k ^{v)-C^v)), keK 

t=k+l 
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Theorem 3. Suppose T — > oo and A n — > 0, and the process X t satisfies 
Assumptions A, B and C. 

(a) We have 

(3.9) y/T{Cp{u) - C p {u)) = Y£\u) + Y} 2) 

(3.10) 



T \ U ) 



{ Y?\u) = O p (VT(\ log A n | A^ V A™^ 2 ) V v^), 

where the result for Yj, ' (u) holds locally uniformly in u, and the convergence 

of Yj}\u) is on the space C(R+) of continuous functions indexed by u and 
equipped with the local uniform topology (i.e., uniformly over compact sets 
ofu£~$l + ), and ty(u) is a Gaussian process with variance-covariance for 
u, v > given by 

/•oo 

(3.H) J ° 

+ ( e -»M" _ Cp{v)){e- U \^ - C p {u))} dt. 

If we strengthen Assumption B to Assumption B' , we get the stronger 
y£\u) = O p (Vf(\ log A n |A^ v Al~^) V yfa). 



(b) For arbitrary integer k>l and every u, v > we have 

(3.12) Z ltP (u)Z ktP (v) A C [ k e-ktlVW dsdt. 

//j further, Lt is a deterministic sequence of integers satisfying — > as 
T — » oo and L T (\ log A n | A* _/3 ' //3 V a! 2_2//3)a1/2 ) -> 0, we have 

— > Zp(u,v), 

where uj{i,Lx) is either a Bartlett or a Parzen kernel. 

The result in (3.9) holds locally uniformly in u. This is important as, in 
a typical application, one needs the Laplace transform as a function of u. We 
illustrate, in the next section, an application of the above result to paramet- 
ric estimation that makes use of the uniformity. We note also that Yip(u,v) 
is well defined because of Assumption C; see [16], Theorem VIII. 3. 79. 
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Under the conditions of Theorem 3, the scaled and centered realized 
Laplace transform can be split into two components, Y^}\u) and Y,j?\u), 
that have different asymptotic behavior and capture different errors in- 
volved in the estimation. The first one, Y^\u), equals VT(± £ e~ u ^ dt- 
E(e -u ' <Tt ' 8 )), which is the empirical process corresponding to the case of 
continuous-record of Xt in which case \<Jt\^ can be recovered exactly. Hence 
the magnitude of Y^\u) is the sole function of the time span T. On the 

(2) 

other hand, the term Yj, (it) captures the effect from the discretization er- 
ror, that is, the fact that we use high-frequency data and not continuous 

(2) 

record of Xt in the estimation. For Yj, (u) to be negligible, we need a con- 
dition for the relative speed of A n — > and T — > oo which, in the general 
case of Assumption B, is given by y/T{\ log A n | A?T^ V A^ 2 " 2//3)A1/2 ) -»• 0. 

The relative speed condition is driven by the biases that arise from using 
the discretized observations of Xt- The martingale term that determines the 
limit behavior of the statistic for a fixed span in Theorem 1 is dominated 
by the empirical process error Y^\u) when the time span increases. The 
leading biases due to the discretization are two: the drift term at and the 
presence of "residual" jump components in Xt in addition to its leading sta- 
ble component at high frequencies. The bias in £p{u) due to the "residual" 

jump components is O p (\ log A n |A^ ^ The higher the activity of the 
"residual" jump components is, the stronger their effect is on measuring the 
Laplace transform of \&t\ ■ Typically, ft' will be determined from a Taylor 
expansion of the Levy density of the driving martingale around zero. In this 
case ft' = ft — 1 and the bias will be bigger for the higher levels of activity ft. 

2—2 / 8 

The bias due to the drift term is O p (A n ), and it becomes bigger the 
lower the activity is of the driving martingale. This bias can be significantly 
reduced if we make use of Vr(X, A n , ft,u) when estimating Cp{u). Finally, 
the orders of magnitude of the above biases can be shown to be optimal by 
deriving exactly the bias in the simple case (covered by our Assumption A) 
in which Xt is Levy and further Zt is a sum of two independent stable 
processes with indexes ft and ft' . 

The relative speed condition here can be compared with the corresponding 
one that arises in the problem of maximum likelihood estimation of Markov 
jump-diffusions; see, for example, [26]. The general condition in this problem 
is TA n — > (also known as the rapidly increasing experimental design); that 
is, the mesh of the grid should increase somewhat faster than the time span 
of the data. In our problem here we need weaker relative speed condition, 
provided we use the stronger Assumption B' and the deviation of Zt from 
a stable process at high frequencies is not too big; that is, ft' is relatively 
low. 



14 



V. TODOROV AND G. TAUCHEN 



Part b of Theorem 3 makes the limit result in (3.10) feasible; that is, it 
provides estimates from the high-frequency data for the asymptotic variance 
of the leading term Y^\u). The first result in it, that is, the limit in (3.12), 
is of independent interest. The sample average of the limit in (3.12) es- 
sentially identifies the integrated joint Laplace transform of |<7t| . This is 
a natural extension of our results here for the marginal Laplace transform 
of \ut\^ and can be used for estimation and testing of the transitional density 
specification of the stochastic scale. We do not pursue this any further here. 

Finally, the proof of Theorem 3 implies also that Cp{u) converges to £p{u) 
in Li(R_|_,o;) where uj{u) is a bounded nonnegative-valued weight function 
with uj{u) = o{u~ l ~ L ) when u — > oo for arbitrarily small i > 0. This can be 
used to invert Cp{u), using regularized kernels as those of [20], to estimate 
nonparametrically the density of the stochastic scale. 

3.3. Long span asymptotics: The case of estimated activity. The asymp- 
totic results in Theorem 3 rely on the premise that f3 is known. The realized 
Laplace transform crucially relies on f3, as the latter enters not only in its 
asymptotic limit and variance but also in its construction. If we put a wrong 
value of f3 in the calculation of the realized Laplace transform, then it is easy 
to see that £r(u) will converge either to 1 or depending on whether the 
wrong value is above or below the true one, respectively. 

In this section we provide asymptotic results for the case where the ac- 
tivity f3 needs to be estimated from the data. Developing an estimate for /? 
from the high-frequency data is relatively easy (we will give an example at 
the end of the section). Hence, here we investigate the effect of estimating f3 
on our asymptotic results in Theorem 3. 

Theorem 4. Suppose there exists an estimator of ft, denoted with (3 and 
Assumptions A, B and C hold. 

— /\ a 

(a) If f3 — f3 = o p (-^=) for some a > 0, then we have 

(3.14) VT{C^{u)-C l3 {u)) = o p 

(b) If f3 uses only information before the beginning of the sample or an 
initial part of the sample with a fixed time-span (i.e., one that does not grow 
withT), and further (3- (3 = O p (A%) for a > 1/(2 (3), /3'</3/2, /3 > 4/3 and 
TA n — > 0, then we have (locally uniformly in u) 

(3.15) VT(C^(u) - Cpiu)) j 2 (0-0)— ►(), 

where Gg(x) = (3x^e~ xf> for x > 0. 
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(c) Under the conditions of part (b), a consistent estimator for 
E(Gfl(uo"t)) is given by 

= =? E ((2n) 1 ^A;^ A n X)sin((2n) i//3 A -i//3 A n x) 

(3.16) 

Unlike the estimation of £p{u), which requires both A n — > and T — > oo, 
the estimation of /3 can be performed with a fixed time span by only sampling 
more frequently. Therefore, typically the error /3 — B will depend only on A n . 
Thus, in the general case of part (a) of the theorem, we will need the relative 
speed condition TA^ — > for some 7 > to guarantee that the estimation 
of B does not have an asymptotic effect on the estimation of the Laplace 
transform of the stochastic scale. By providing a bit more structure, mainly 
imposing the restriction that /3 is estimated by a previous part of the sample 
or an initial part of the current sample with a fixed time span, we can derive 
the leading component of the introduced error in our estimation. This is 
done in part (b) of the theorem, where it is shown that the latter is a linear 
function of B — B (appropriately scaled). As we mentioned earlier, B does not 
need long span, just sampling more frequently, that is, A n — > 0. Therefore, in 
a practical application one can estimate (3 from a short period of time at the 
beginning of the sample and use the estimated fB and the rest of the sample 
(or the whole sample) to estimate the Laplace transform of the stochastic 
scale. In such a case, part (b) allows us to incorporate the asymptotic effect 
of the error in estimating B into calculation of the standard errors for Cp{u). 
For this, one needs to note that the errors in (3.9) and (3.15) in such case 
are asymptotically independent. 

A more efficient estimator, in thejsense offaster rate of convergence, will 
mean that the approximation error C^(u) — C/j(u) will be smaller asymptot- 
ically. Finally, the lower bound on a in part(b) of the above theorem would 
typically be satisfied when /3' < (3/2. We finish this section by providing an 
example of \J A n -consistent nonparametric estimator of f3 (when /?' < /3/2), 
developed in [27]. The estimation is based on a ratio of power variations 
over two time scales for optimally chosen power. It is formally defined as 

f3 i7) 8= ^ 

K ' } P ln(2)+ln[$ T (X,p*,2A n )]-ln[$ T (X,p*,A n )]' 

where p* is optimally chosen from a first-step estimation of the activity, and 
the power variation $^(X,p, A n ) is defined as 

P7A„] 

(3.18) f> T (X,p,A n )= \^iX\ p . 

i=i 
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It is shown in [27] that /3 in (3.17) is \J A n -consistent for T fixed with an 
associated feasible central limit theorem also available. 

4. Monte Carlo assessment. We now examine the properties of the es- 
timators of the Laplace transform both in the case when activity of the 
driving martingale is known or needs to be estimated from the data, Cp{u) 
and Cp{u), respectively. The Monte Carlo setup is calibrated for a finan- 
cial price series. In particular, we use 1000 Monte Carlo replications of 1200 
"days" worth of 78 within-day price increments and this corresponds ap- 
proximately to the span and the sampling frequency of our actual data set 
in the empirical application. The model used in the Monte Carlo is given 
by 

(4.1) dX t = V t 1/P dL t , dV t = 0.02(1.0- V t ) dt + Qm^/V t dB u 

where Lt is a Levy process with characteristic triplet (0,0, u) for v{x) = 

, n + 1 7 or = 0,1 mi+i 2 ? X ■ The first choice of the Levy measure corre- 
al 1 \ x \ 

sponds to that of a stable process with activity index of 1.7 while the second 
one is that of a tempered stable process with the same value of the activity 
index. For the second choice of v(x), Assumption A is satisfied with f3' = 0.7, 
which indicates a rather active "residual" component in the driving martin- 
gale, in addition to its stable part. Therefore, the second case represents 
a very stringent test for the small sample behavior of the RLT. 

Table 1 summarizes the outcome of the Monte Carlo experiments. The 
first two columns of the table report the results for the case when the ac- 
tivity is known and fixed at its true value. In both cases, the estimate is 
very accurate and virtually unbiased. The third column presents the results 
for the case when the inference is done with (3 = 2 (with Lt being tem- 
pered stable) which corresponds to treating erroneously the process Xt as 
a jump-diffusion. As seen from Table 1 this results in a rather nontrivial 
upward bias. The reason is that in forming the realized Laplace transform 

1/17 

the increments should be inflated by the factor A n ' but they are in- 

— 1/2 

stead inflated by the much smaller A n . Using the under-inflated incre- 
ments in the computations induces a very large upward bias in the estima- 
tor. 

The last two columns of Table 1 summarize the Monte Carlo results for 
the case where the index /3 is presumed unknown and estimated using (3.17) 
based on the first 252 "days" in the simulated data set. As to be be expected, 
the estimator of the Laplace transform is less accurate than when the activity 
is known. In the case when the driving martingale is tempered stable, our 
measure becomes slightly biased due to a small bias in the estimate of the 
activity level /3. These biases, however, are relatively small when compared 
with the standard deviation of the estimator. 



REALIZED LAPLACE TRANSFORMS FOR PURE- JUMP 17 







Table 


1 










Monte Carlo 


results 








S 


TS 


TS 


S 


TS 




Fixed at 


Fixed at 


Fixed at 


Estimated 


Estimated 


Activity 


true value 


true value 


= 2 






true value 


0.9051 


0.9051 


£,3(0.10) 
0.9051 


0.9051 


0.9051 


mean 


0.9052 


0.9085 


0.9390 


0.9057 


0.9137 


std 


0.0063 


0.0065 


0.0045 


0.0074 


0.0072 


true value 


0.6112 


0.6112 


£,3(0.50) 
0.6112 


0.6112 


0.6112 


mean 


0.6111 


0.6159 


0.7771 


0.6141 


0.6434 


std 


0.0208 


0.0218 


0.0145 


0.0292 


0.0284 


true value 


0.3001 


0.3001 


M1.25) 
0.3001 


0.3001 


0.3001 


mean 


0.2998 


0.3035 


0.5776 


0.3050 


0.3449 


std 


0.0249 


0.0259 


0.0231 


0.0383 


0.0393 


true value 


0.0980 


0.0980 


£,3(2.50) 
0.0980 


0.0974 


0.0980 


mean 


0.0977 


0.0994 


0.3753 


0.1024 


0.1312 


std 


0.0158 


0.0164 


0.0265 


0.0261 


0.0297 


true value 


0.0344 


0.0344 


£0(3.75) 
0.0344 


0.0344 


0.0344 


mean 


0.0342 


0.0350 


0.2544 


0.0374 


0.0536 


std 


0.0084 


0.0087 


0.0249 


0.0142 


0.0179 



Note: In all simulated scenarios T = 1200 and [1/A n ] = 78. The mean and the standard de- 
viation (across the Monte Carlo replications) correspond to the estimator £0(11) (the first 
three columns) or £^(u) (the last two columns). The estimator j3 is computed using (3.17) 
and the first 252 "days" of the sample. Vt has Gamma marginal law with corresponding 
Laplace transform of (1 + u * 0.05 2 /0.04)-°' 04/0 ' 052 . The Monte Carlo replica is 1000. 



5. An application to parametric estimation of the stochastic scale law. 

We apply the preceding theoretical results to define a criterion for para- 
metric estimation based on contrasting our nonparametric realized Laplace 
transform to that of a parametric model for the stochastic scale (or the time 
change) . 

Theorem 5. Suppose the conditions of Theorem 3 are satisfied. Let the 
Laplace transform of \<Jt\^ be given by £p(u;0) for some finite- dimensional 
parameter vector lying within a compact set £ with 6q denoting the true 
value and further assume that £p(u\9) is twice continuously- differentiable 
in its second argument. If Q l is some local neighborhood of 6q, assume 
svLpg e Qi{\V$£p(u; 6)\ +V 6QiCp(u\8)\} bounded. Suppose for a kernel func- 
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tion with bounded support k : M+ — > M+ we have that 

(C/3(u; 9) - Cp{u; 9 )) 2 k(u) du>0, 9^ 9 . 
Define the estimator 

(5.1) 9 = argmin / (£p(u) — £/3(u;9)) 2 k(u) du, 

6»ee Jr + 

where k is a nonnegative estimator of k with (u 2+t V 1) sup ugK+ \k(u) — 
k(u)| for some t > 0. Then for T — > oo and TA n — > 0, we have 

(5.2) VT(9-9o)^^ V 6 £p(u;9)V 6 £f3(u;9y K ( U )du^ E^E', 
where E' is a standard normal vector and 

(5.3) H= / / Y,p(u,v)V e£p(u;9)V e£p{v]9)' k(u)k{v) dudv, 

Jr+ Jr+ 

for T,p(u,v) the variance- covariance of Theorem 3. 

Remark 4. There are two types of pure-jump models used in practice. 
First are the time-changed Levy processes; see, for example, [11] and [7]. 
As explained in Remark 2, the time-change at corresponds to \o~t\^ in (1.1). 
Therefore, Cp{u) provides an estimate of the Laplace transform of the time- 
change which is modeled directly in parametric settings. The second type of 
pure-jump models are the ones specified via Levy-driven SDE. In this case 
we typically model at and not \o~t\^ ■ Therefore, to apply Theorem 5 in this 
case one will need to evaluate Cp{u\9) via simulation. In both cases, the 
use of RLT simplifies the estimation problem significantly, as it preserves 
information about the stochastic scale and, importantly, is robust to any 
dependence between at and Zt, which, particularly in financial applications, 
is rather nontrivial. 

The theorem was stated using Cp{u), but obviously the same result will 
apply if we replace it with C^(u). By way of illustration, we apply the theory 
to the VIX index computed by the Chicago Board of Options Exchange; the 
VIX is an option-based measure of market volatility. The data set spans the 
period from September 22, 2003, until December 31, 2008, for a total of 1212 
trading days. Within each day, we use 5-minute records of the VIX index 
corresponding to 78 price observations per day. [28] present nonparametric 
evidence indicating that the VIX is a pure-jump ltd semimartingale. 

The underlying pure-jump model we consider for the log VIX index, de- 
noted by vt, is 

dvt = atdt+ I I fl(dx,dt), 
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where at is the drift term capturing the persistence of vt, and p, is a ran- 
dom integer- valued measure that has been compensated by a t dt®v{dx) 
for at a stochastic process capturing time varying intensity. The martin- 
gale component of vt is a time-changed Levy process, as in [11]. Recall that 
the time-change at corresponds to | cr^ | ^ in the general model (1.1), and our 
interest here is in making inferences regarding its marginal distribution. 

The parametric specification we use for the marginal distribution of the 
time-change is that of a tempered stable subordinator [24], which is a self- 
decomposable distribution, that is, there is an autoregressive process of order 
one that generates it [25]. The Laplace transform of the tempered stable is 



where 6 = (a c A) is the parameter vector, T(—a) = — -T(l — a) for a G (0, 1), 
T denotes the standard Gamma function, a S [0,1) can be interpreted as 
the activity index of the time-change at, c is the scale of the marginal 
distribution of at and A governs the tail. 

To make the estimation feasible, we need an estimate of /3 and to further 
specify the kernel k of Theorem 5. For /3 we use the estimator defined in equa- 
tion (3.17) over the first year of the sample, exactly as in the Monte Carlo 
work; the point estimate is /3 = 1.862 with standard error 0.034. We next 
follow [23] in using a Gaussian kernel k(u) = exp(— 2it 2 /u^ iax ) where u max is 
defined via V u £g(ii max , #o) = —0.05. The point u max is set so that we col- 
lect most of the information available in the empirical Laplace transform. 
The feasible kernel k(u) is constructed from the infeasible by replacing it max 
with a consistent estimator for it. It is easy to verify that this choice of the 
kernels satisfies the conditions of Theorem 5 above. 

Table 2 shows the parameter estimates and asymptotic standard errors 
based on this feasible implementation of (5.1)-(5.3). Interestingly, a is es- 
timated to be below that associated with the inverse Gaussian (a = 1/2), 
while the estimated tail parameter A suggests relatively moderate dampen- 
ing, but this parameter appears somewhat difficult to estimate with high 
precision, given the time span of our data set. Figure 1 shows the fit of 




ex.p{cT(-a)[{\ + u) a - \ a }}, if a G (0,1) 



(5.4) 



Table 2 
Estimation results 



Parameter 



Estimate 



Standard Error 



Q 

c 
A 



0.2651 
1.2872 
0.0377 



0.0453 
0.0469 
0.0103 
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0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 



Fig. 1. The solid line is the fitted parametric Laplace transform of the marginal distri- 
bution of the scale of the log VIX index, which essentially plots on top of the estimated 
realized Laplace transform (not visible), and the dashed lines show 95 percent nonparamet- 
ric confidence intervals about the estimated realized Laplace transform. 

the model. Specifically, the heavy solid line is the model-implied Laplace 
transform evaluated at the estimated parameters. It plots on top of the 
(not visible) realized Laplace transform, (3.1), and thereby passes directly 
through the center of the (nonparametric) confidence bands. Overall, the fit 
of the tempered stable to the marginal law of the time-change is quite tight. 
From this point, one can follow the strategy of [5] and go further to develop 
a dynamic model for the time-change by coupling the fitted marginal law 
with a specification for the memory of the process. 

6. Conclusion. We derive the asymptotic properties of the realized La- 
place transform for pure-jump processes computed from high-frequency data. 
The realized Laplace transform is shown to estimate the Laplace transform 
of the stochastic scale of the observed process. The results are (locally) uni- 
form over the argument of the transform. We can thereby also derive the 
asymptotic properties of parameter estimates obtained by fitting parametric 
models for the marginal law of the stochastic scale to the realized Laplace 
transform. This estimation entails minimizing a measure of the discrepancy 
between between the model-implied and observed transforms. 
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7. Proofs. Here we give the proof of the main results in the paper: 
Lemma 1 and Theorems 1 and 3, with the rest relegated in the supple- 
mentary Appendix. In all the proofs we will denote with C a constant that 
does not depend on T and A n , and further it might change from line to line. 
We also use the shorthand for E(-|^(i_i)A„)- We start with stating 
some preliminary results, proofs of which are in the supplement, and which 
we use in the proofs of the theorems. 

7.1. Preliminary results. For a symmetric stable process with Levy mea- 
sure ujf+i dx for some c> and f3 £ (1,2), using Theorems 14.5 and 14.7 
of [25], we can write its characteristic function at time 1 as 

( r°° dr f°° dr \ 

expfc J (e lur - 1 - iur)-^ + c J (e~ lur - 1 + iur)-^\ , iiGl. 

Then using Lemma 14.11 of [25], we can simplify the above expression to 

exp(2cr(-/3)cos(/3vr/2)|n| /3 ), 

where F(—/3) = grgzf] f° r P £ (1)2). Therefore, the Levy measure of a j3- 
stable process, L t , with E(e~ iuLt ) = e -*l"l /3 / 2 ) i s 

A X jxp+i dX ' 

for A defined in (2.3). Throughout, after appropriately extending the original 
probability space, we will use the following alternative representation of the 
process Xt (proof of which is given in the supplement): 

X t = X + / a s ds + / / a s -xjli(ds,dx) 
Jo Jo Jr 

.xfi 2 (ds,dx) - / / a s _xfi3(ds,dx) + Y t , 
Jo Jr 

where fi±, ^2 and [13 are homogenous Poisson measures (the three measures 
are not mutually independent), with compensators, respectively, u\{dx) = 
nA+i dx, Vi(dx) = \v'(x)\ dx and v$(dx) = 2|z/(x)| l(y'(x) < 0) dx and a s = 

a s — a s - j R xv'{x) dx. Finally, to simplify notation we will also use the short- 
hand L t = Jq f R xjli(ds,dx) and further for a symmetric bounded function k 
with k{x) = x for x in a neighborhood of zero, we decompose 

Lt = L t + L t , 

(7.1) 

L t = / (x — n(x))p,i(ds, dx), L t = / n(x)fli(ds,dx). 



rt 
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With this notation we make the following decomposition: 

r-T 



(7.2) 



V T (X,A n ,(3,u)- [ e~ u ^dt 
Jo 



[T/A n ] 3 

1=1 j=l 



- u ^dt, 



[T/A n ]A„ 



iA n 



(i-l)A„ 



= / ( e -«l^-i)A„-l^_ e -«k s | 5 )dS; 

J(i-l)A n 



= A n (cos((2 U ) 1 ^ A - 1 ^A"X) - cos((2 U ) 1 /^ ( ._ 1)An _ A^^ A ?L)). 



Starting with using the self-similarity of the stable process Lt, and 
the expression for its characteristic function, we have 

-V p A£L) - e -«h>-i)A„-l' 3 ) = o, 



(7.3) 



E^_ 1 (cos((2n) 1 /' 3 



(i-l)An- 



Et 1 (cos((2n) 1 /^ {i _ 1)An _A^ 1//3 Af L) - e^-DAn-l') 2 
= ^( U 1 //3| fT(j _ 1)A _| )) 

_ Et 1 (cos((2n) 1 ^ (T{i _ 1)An _A r 7 1//3 Af L) - e -«k(*-DAn-l / ') 4 < C. 
Using first-order Taylor expansion we decompose £>J = X,7=i£iuu) wnere 



I I / (r u dW u + / / <5(u— ,x)p,(du,dx) ) (is, 

J(i-l)A n \J(i-l)A n J(i-l)A n JR ) 



^2(2) 



(i-l)A„ V(i-1)A 
«A n 

(T(o-*,n) - T(o- (i _ 1)An _ u )) 

(i-l)An 



(i-l)A* 



iA* 



(i-l)A* 



(»-l)A„ 
)ds, 



S(u—,x)jl(du,dx) I (is 



where T(x,u) = (3 sign{x}u\x\P 1 e U N' 8 , <j* is a number between <7(j_i)A n - 
and a*, and 



5 S = (T(j_i)A n _ + / ct u dW u + 

■/(i-l)A„ 



(i-l)A* 



5(u—,x)£i(du, dx), 

se[(i-l)A n ,iA n ]. 
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We derive the following bounds in the supplement for any finite u > 

[T/An] 

E E?-i«$(l)) = °, 



(7.4) 



8=1 



T 



/[T/A„] 

t(2)/ 



*-e^ e ^(egci)) 2 ) <c, 

[T/A„ 



(7.5) (TA^ 2 )- 1 £ E ( SU P 1^2(2)1) <C, 

P7A„] 



(7-6) (TA„r x £ E supj4 2 j(3)| <C. 



0<U<M 
2=1 — — 



(3) 

Turning to Qj,, we can first make the following decomposition [recall the 
decomposition of Lt in (7.1)]: 



l 



(7.7) cos(xi) - cos(x5) = ^2[cos(xj) - cos(xj+i)L 

i=i 

/ riA„ riA n \ 

X2 = (2u) l ^A- l ^[ a s ds+ <Js-dL s ), 

Vi(i-1)A„ J(i-l)A n J 

Xs = (2^) 1 / /3 A- 1 //3 A n a (j _ 1)An + / a s _ dL s , 

V J(i-l)A n / 

X4 = (2^) 1 ^A; 1 //3 (J(i _ 1)An _ A n£ ) 
X5 = (2 U ) 1 ^A; 1 //3 (J(i _ 1)Ara _ A n L . 

Then, using the formula for cos(:e) — cos(y), x,y S M for the first bracketed 
term on the right-hand side of (7.7) and a second-order Taylor expansion 

for the third one, allows us to write Q u = Ylj=i^iuU) where 

eg(i)=-2A„ 



sin 0.5(2u) 1 / /3 A" 1 / /3 A"A + / a s ds + <r s -dL s ) 

V V J(i-l)A n J(i-l)A n J 

/ / riA n riA n \ 

sin 0.5(2m) 1 / /3 A" 1 / /3 A^A- / a s ds- <7 s -dL s ) 



24 



V. TODOROV AND G. TAUCHEN 



^ (2) = -(2 tI ) 1 %rW S m((2«) 1 ^ (l _ 1)A „_A n - 1 %?I)a (i _ 1)A „ 
^ (3) = -(2.) 1 ^ A rV/3 sin((2n) i//3 CJ(j _ i)An _ A -i//3 A n Z) 

/•£A„ _ 

X / (<T s _-(7( i _i)A tl -)(iL s , 
J(t-1)A„ 

Ci,tt( 4 ) = A n[cOs(x2) - C0S(X3)] + A n [cOs(x4) ~ COs(x 5 )], 



d?(5) = 0.5(2n)^A^ 



2//3 



x cos(x) A n a (i _ 1)An + / (cr s „ - cr ( i_ 1)An _) (iL s , 

V J(i-l)A n J 

with x denoting some value between xs an d X4- 

We derive the following bounds in the supplement for any finite u > 0: 



(7.8) 



[T/A.., 

(r|log(A n )|A^'^)-i ^ E 



/ A 3-2//3s_ x /[T/A„] 

(7-9) E?_ 1 (eg(2))=0, (A "^ } E> ^ 



, sup ieS(i)i)<c, 

^=1 — — 



(7.10) 
(7.11) 
(7.12) 



[T/Ar. 

(TA 3/2-l/^-l £ ] 
i=l 



E C-i(eg(2)) 2 <c, 

i=l / 
^ E( sup |eg(4)|)<C, 



[T/A n ] 

(TA 2-2/^-l £ E gup |eg) (5)|) <C, 

2 = 1 — — 



sup 

0<U<U 



[T/A n ] 

E E ?-i^( 3 ) 



1=1 



(7.13) (TAy^ v/? " +4 ))~ 1 E( 



I sup 

\ 0<«<« 



<C 

under Assumption B', 

[T/A„] \ 

E E r-!eg(3) 



i=l 



(7.14) 



[T/A n ] 

(rA 3-2/^-i ^ E sup | C g(3)| 2 )<a 

2=1 — — 



< c 

under Assumption B, 
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7.2. Proof of Lemma 1. Since hr 1 '^ 'Zht is a Levy process, to prove the 
convergence of the sequence, we need to show the convergence of its charac- 
teristics (see, e.g., [16], Corollary VII. 3. 6); that is, we need to establish the 
following for h — > 0: 

h i -w ! K 2 {h-yp X )v{x)dx- 



(7.15) 



K 2 (x 



dx 



9(x)-, 



A 



1/3+1 



dx, 



dx, 



where g is an arbitrary continuous and bounded function on R, which is 
around 0. 

The result in (7.15) follows by a change of variable in the integration, and 
by using the fact that by Assumption A we have |^'(x)| < for \x\ < xq 

where xq is fixed and (3' < f3. 

7.3. Proof of Theorem 1. Part (b) of the theorem holds from the bounds 
in (7.3)-(7.6) and (7.8)-(7.14), so we are left with showing part (a). First, 

we show that for A n — > 0, -7= Ylt=i"^ converges stably as a process in t 
for the Skorokhod topology to the process 

Jo ^ F p{ ul/l3 Ws\)dW' s , where W[ 
is a Brownian motion defined on an extension of the original probability 
space and is independent from the cr-field T '. Using the result in (7.3), we 
get for every t > 0, 

[t/A n ] 

0-h 



A, 



VP 



(T, 



\ ds, 



0. 



i=l 
1 [t/An] 

^ E ^i(0 2 

i=l 

, It/An] 

^ E ^& 

n i=i 

where for the second convergence above, we made use of Riemann integra- 
bility. Thus to show the stable convergence, given the above result and upon 
using Theorem IX. 7. 28 of [16], we need to show only 

(7.16) E K-i(Vy^^?M) A Vt > 0, 



i=l 



where M is a bounded martingale defined on the original probability space. 

When M is discontinuous martingale, we can argue as follows. First, we 
can set M? = M [t/An]An and N? = £fif " ] 07^2 for any t. With this 
notation we have [M n ,N n ] 



Ei=^(N/V^^«A?M) and <M",i\P) 
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El=i E ?-i(VV A n4«AfM). We trivially have that M n converges (for 
the Skorokhod topology) to M, and furthermore from the results above the 
limit of N n , which we denote here with N, is a continuous process. There- 
fore, using Corollary VI.3.33(b) of [16], we have that (M n ,N n ) is tight. 
Then, using the fact that M is a bounded martingale (and hence it has 
bounded jumps), we can apply Corollary VI. 6. 29 of [16] and conclude that 
the limit of [M n ,N n ] (up to taking a subsequence) is [M, N]. However, 
since continuous and pure-jump martingales are orthogonal (see, e.g., Def- 
inition 1.4.11 of [16]), we conclude that [M, N) =0. Further, the difference 
[M n ,N n ] — (M n ,N n ) is a martingale, and using Ito isometry, the fact that 

y/ 1 / An^ 1 ^ < Cy/A n , and the boundedness of M, we have 
E([M n ,N n ] t - (M n ,N n ) t ) 2 = e(Y^( AM s A K) 2 ) 

<CA n E(^(AM s n ) 2 )<CA n . 

Therefore, [M n ,N n ] — (M n ,N n ) converges in probability to zero, and hence 
so does (M n ,N n ). 

When M is a continuous martingale, we can write E"_ 1 (^-^A"M) = 
E"_i(A^jVA"Af) where now we denote N t = \& t ) for te[(i- 1) A n , iA n ] 

(which is obviously a martingale with respect to the filtration &t). However, 

note that is uniquely determined by &u-\\& n and the homogenous Pois- 
son measure Therefore, Nt remains a martingale for the coarser filtra- 
tion = «^(i_i)A n C '^t 1 f° r '^t 1 being the filtration generated by the 
jump measure jjL\. Then using a martingale representation for the martin- 
gale (iV*)t>(i_ i)A n with respect to the filtration J^" t * (note \i\ is a homoge- 
nous Poisson measure), Theorem III. 4. 34 of [16], we can represent Nt as 
a sum of =^(i-i)A n -adapted variable and an integral, with respect to p,\. But 
then since pure-jump and continuous martingales are orthogonal, we have 
E?_i($A?M) = 0. 

This establishes the stable convergence of A El— ^ Ci}, ■ Next, the 

bounds for ^ and fj-JJ in (7.4)-(7.6) and (7.8)-(7.11) imply that 

-77f EEl^Cfi? is asymptotically negligible for A n -> 0, and T 

fixed. 

7.4. Proof of Theorem 3. Part (a). The proof consists of showing finite- 
dimensional convergence in u and tightness of the sequence: 

(1) Finite- dimensional convergence. First, given Assumption C, and using 
a CLT for stationary and ergodic process (see [16], Theorem VIII. 3. 79), we 
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have for a finite-dimensional vector u, 

(7.17) Vt(± £ e-l^ dt - C p (u)\ A % 

where ^ is a zero-mean normal variable with elements of the variance- 
covariance matrix given by Y,p(ui, Uj). 

Next, the results in Section 7.1 imply for T — > oo and A n — > under the 
weaker Assumption B. 

! [T/A n ] [T/A n ] 

j= y: = o P ( VA n) , j= y: eg = o p {vt^ v v^), 

(7.18) i=1 

n PT/An] 

■4 E 42 = log(A n )|Al^'//3 V A (2-2/«Al/ 2) v ^ 

^ i=l 



with the last one replaced with the weaker 

, PT/An] 

4 E Cg =^(1108(^)1^-^7 A 

* i=l 



r/^w 7 ^), 



when the stronger Assumption B' holds. 

(2) Tightness. Let's denote for arbitrary u,v>0, 

Zt = (e-fH" _ ^(n)) _ ( e -«k*l' _ ^(v)). 

Then, using successive conditioning and Lemma VIII. 3. 102 in [16], together 
with the boundedness of Zt and Assumption C, we get 

2 

E| 



/ i T \ i r T r T 

\yfJ2 z * dt ) =fj J E ( z t z ») dadt 

KC^I'-v 1 !*^ C f T E(\a sAt f/PE(z sVt \^ sAt ))dsdt 
1 Jo Jo 



< c .| u i/p_ u i/p|i+ t l. £ j\ a ^ sl ) 1/3 ~ L dtds 



T . 

r 

<C\u l / p -v 1/p \ l+i / {af x ) l/3 -'ds<C\u l / p -v l/p \ l+L , 



where i > is the constant of Assumption C and p > 3. Using Theorem 12.3 
of [8], the above bound implies the tightness of the sequence A= §q {eT u \ at \ — 
Cp(u)) dt, and from here we have its convergence for the local uniform topol- 
ogy 
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Turning now to X^!=i A "^ C'u > we can use the ana l°g of the result 
in (7.3) for eg to get 

/ -, P7A„] \2 

for some constant C. From here, using Theorem 12.3 of [8], we get the 
tightness of ^= • 

/o\ /'oN 

Similarly, using the analog of (7.9) applied to Qui 2 ) ~ ( 2 )> 

we have 

A -(3-2/0) /[T/A„] \2 

(7.19) £ «g(2)-$(2))J <C|^_^|2. 

This establishes tightness for A~ (3/2 ~ 1//3) E!=i A " ] f,>(2)- We can do exactly 

the same for A ? T (3/2 - 1//3) E!=i Anl (4? ( 3 ) " E ?-i(^>( 3 ))) usin § the analo § 
of (7.14) applied to ^(3) - ^(3) -^(^(3) - ^(3)). Next, 

2 /pT/An] \ 2 

( 7 . 2 0) _JL_ E ( £ («g(l)-^?(l))J <C( U - W ) 2 , 

where we used successive conditioning and further made use of the inequality 

\T(x,u) — T(x,v)\<C\xf~ 1 \u-v\, iel,!i,«>0, 

which follows from applying first-order Taylor expansion of T(x,u) in its 
second argument and using the fact that the derivative of T(x,u) in its 
second argument is bounded by C\x\^ _1 . Therefore, Yli=^™^ * s 
tight on the space of continuous functions equipped with the local uniform 
topology. 

Next, using the results in Section 7.1, it is easy to show that for any finite 
u > 0, we have 



(7.21) lim P sup 

A ? 40,rtoo \ o< u <u 



[T/An] 

£ (TA^)- 1 eg(2) 
i=i 



> e n = Ve n t oo. 



The same holds when, in the above, we replace (TA^ 2 ) 1 Cl«(2) with ei- 
ther of the following terms: (TA n ) -1 £g(3), (T|log(A n )|Ajr^)- 1 ^(l), 

(rA 3/2-l/0 rle (3) (4)) (rA 2-2//3 rle (3) (5) M ^ „ (^"^(^(S)) 

under Assumption B' and (TA^ V ^ + ^)~ 1 IE™_ 1 (^ 3 1 ] (3)) under the weaker 
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Assumption B. This implies that those terms are, uniformly in u, bounded 
in probability. 

Part (b). First, (3.12) follows directly from Theorem 1, so here we only 
show (3.13). If we denote for k > 0, 



1 



T 



t=k+l 



then under our assumptions, by standard arguments (see, e.g., Proposition 1 
in [4]), we have 



\ — It 

(7.22) C 0> p(u,v) +2_^u}(i,LT){C i ^(u,v) + C i ^(v,u)) — > £g(u,?/)- 



i=l 



Therefore, we are left showing 

(C ,/3(u,v) - C 0> p(u,v)) 



(7.23) 



v,u 



-Ci^(u,v) -Ci^(v,u)) — >0. 
We note that for arbitrary 1 < k < T, we have 

[k/A n ] k 
A n V cos((2n) 1//3 A- 1 / /3 A"X) < 1 and / e~ u ^ ds < 1. 

• ,n TT^a i , , Jk-1 



i=[(As-l)/A„]+l 
Hence, for k = 0,1, ... , Lt, we have 



\d kJi (u,v) - C kj p(u,v)\ < i V Zt tP (v) - f e~ u ^ds 

1 t=i 



1 T 



e~ v ^ds 



t-i 



t=i 

+ \£p( u ) - £/?( u )l 

+ \Cb(v)-Cb(v)\+0 



k 
f 

First, using the CLT result in (7.17), and since Lt/VT — > 0, we have 
(7.24) ^\u;(i,L T )\\£p(u)-£p(u)\ -Ao Vu > 0. 
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Further, using the stationarity of the process at and the bounds on the 
moments of the terms £p2 derived in Section 7.1, we have for every t>l, 



E 



i-i 



< C{\ log A n |A^ /3 '/ /3 V A^ 2 /« A1 /2). 



Therefore, using the relative speed condition between Lt and A n in the 
theorem, we have 



T 



t=\ 



t-i 



Vu > 0. 



(7.24) and (7.25) imply (7.23), and this, combined with (7.22), establishes 
the result in (3.13). 
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SUPPLEMENTARY MATERIAL 

Supplement to "Realized Laplace transforms for pure-jump semimartin- 
gales" (DOI: 10.1214/12-AOS1006SUPP; .pdf). This supplement contains 
proofs of the preliminary results in Section 7.1 as well as the proofs of The- 
orem 2, 4 and 5. 
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